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Abstract 

S: 

Q\ \ We study optimal investment in a financial market having a finite number of assets from a signal 

processing perspective. We investigate how an investor should distribute capital over these assets and when 
he should reallocate the distribution of the funds over these assets to maximize the cumulative wealth 
over any investment period. In particular, we introduce a portfolio selection algorithm that maximizes the 
expected cumulative wealth in i.i.d. two-asset discrete-time markets where the market levies proportional 
transaction costs in buying and selling stocks. We achieve this using "threshold rebalanced portfolios", 
where trading occurs only if the portfolio breaches certain thresholds. Under the assumption that the 
relative price sequences have log-normal distribution from the Black-Scholes model, we evaluate the 
expected wealth under proportional transaction costs and find the threshold rebalanced portfolio that 
achieves the maximal expected cumulative wealth over any investment period. Our derivations can be 
readily extended to markets having more than two stocks, where these extensions are pointed out in the 

£IJ ' paper. As predicted from our derivations, we significantly improve the achieved wealth over portfolio 

selection algorithms from the literature on historical data sets. 
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I. Introduction 

Recently financial applications attracted a growing interest from the signal processing community 
since the recent global crises demonstrated the importance of sound financial modeling and reliable data 
processing [1], [2]. Financial markets produce vast amount of temporal data ranging from stock prices 
to interest rates, which make them ideal mediums to apply signal processing methods. Furthermore, due 
to the integration of high performance, low-latency computing recourses and the financial data collection 
infrastructures, signal processing algorithms could be readily leveraged with full potential in financial 
stock markets. This paper particularly focuses on the portfolio selection problem, which is one the most 
important financial applications and has already attracted substantial interest from the signal processing 
community [3]-[7]. 

In particular, we study the investment problem in a financial market having a finite number of assets. We 
concentrate on how an investor should distribute capital over these assets and when he should reallocate the 
distribution of the funds over those assets in time to maximize the overall cumulative wealth. In financial 
terms, distributing ones capital over various assets is known as the portfolio management problem and 
reallocation of this distribution by buying and selling stocks is referred as the rebalancing of the given 
portfolio [8]. Due to obvious reasons, the portfolio management problem has been investigated in various 
different fields from financial engineering [9], machine learning to information theory [10], with a sig- 
nificant room for improvement as the recent financial crises demonstrated. To this end, we investigate the 
portfolio management problem in discrete-time markets when the market levies proportional transaction 
costs in trading while buying and selling stocks, which accurately models a wide range of real life markets 
[8], [9]. In discrete time markets, we have a finite number of assets and the reallocation of wealth (or 
rebalancing of the capital) over these assets is only allowed at discrete investment periods, where the 
investment period is arbitrary, e.g., each second, minute or each day [10], [11]. Under this framework, we 
introduce algorithms that achieve the maximal expected cumulative wealth under proportional transaction 
costs in i.i.d. discrete-time markets extensively studied in the financial literature [8], [9]. We further 
illustrate that our algorithms significantly improve the achieved wealth over the well-known algorithms 
in the literature on historical data sets under realistic transaction costs, as anticipated from our derivations. 
The precise problem description including the market and transaction cost models are provided in Section 
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Determination of the optimum portfolio and the best portfolio rebalancing strategy that maximize 
the wealth in discrete-time markets with no transaction fees is heavily investigated in information theory 



[10], [11], machine learning [12]— [14] and signal processing [15]— [18] fields. Assuming that the portfolio 
rebalancings, i.e., adjustments by buying and selling stocks, require no transaction fees and with some 
further mild assumptions on the stock prices, the portfolio that achieves the maximum wealth is shown to 
be a constant rebalanced portfolio (CRP) [11], [19]. A CRP is a portfolio strategy where the distribution 
of funds over the stocks are reallocated to a predetermined structure, also known as the target portfolio, 
at the start of each investment period. CRPs constitute a subclass of a more general portfolio rebalancing 
class, the calendar rebalancing portfolios, where the portfolio vector is rebalanced to a target vector 
on a periodic basis [8]. Numerous studies are carried out to asymptotically achieve the performance of 
the best CRP tuned to the individual sequence of stock prices albeit either with different performance 
bounds or different performance results on historical data sets [11], [12], [14]. CRPs under transaction 
costs are further investigated in [20], where a sequential algorithm using a weighting similar to that 
introduced in [19], is also shown to be competitive under transaction costs, i.e., asymptotically achieving 
the performance of the best CRP under transaction costs. However, we emphasize that maintaining a 
CRP requires potentially significant trading due to possible rebalancings at each investment period [15]. 
As shown in [15], even the performance of the best CRP is severally affected by moderate transaction 
fees rendering CRPs ineffective in real life stock markets. Hence, it may not be enough to try to achieve 
the performance of the best CRP if the cost of rebalancing outweighs that which could be gained from 
rebalancing at every investment period. Clearly, one can potentially obtain significant gain in wealth by 
including unavoidable transactions fees in the market model and perform reallocation accordingly. 

In these lines, the optimal portfolio selection problem under transactions costs is extensively investigated 
for continuous-time markets [21]-[24], where growth optimal policies that keep the portfolio in closed 
compact sets by trading only when the portfolio hits the compact set-boundaries are introduced. Naturally, 
the results for the continuous markets cannot be straightforwardly extended to the discrete-time markets, 
where continuous trading is not allowed. However, it has been shown in [25] that under certain mild 
assumptions on the sequence of stock prices, similar no trade zone portfolios achieve the optimal growth 
rate even for discrete-time markets under proportional transaction costs. For markets having two stocks, 
i.e., two-asset stock markets, these no trade zone portfolios correspond to threshold portfolios, i.e., the 
no trade zone is defined by thresholds around the target portfolio. As an example, for a market with two 
stocks, the portfolio is represented by a vector b = [b 1 — b] T , b G [0, 1], assuming only long positions 
[8], where b is the ratio of the capital invested in the first stock. For this market, the no rebalancing region 
around a target portfolio b = [b 1 — b] T , b G [0, 1], is given by a threshold e, min{6, 1 — b} > e > 0, such 
that the corresponding portfolio at any investment period is rebalanced to a desired vector if the ratio of 



the wealth in the first stock breaches the interval (b — e, b + e). In particular, unlike a calendar rebalancing 
portfolio, e.g., a CRP, a threshold rebalanced portfolio (TRP) rebalances by buying and selling stocks 
only when the portfolio breaches the preset boundaries, or "thresholds", and otherwise does not perform 
any rebalancing. Intuitively, by limiting the number of rebalancings due to this non rebalancing regions, 
threshold portfolios are able to avoid hefty transactions costs associated with excessive trading unlike 
calendar portfolios. Although TRPs are shown to be optimal in i.i.d. discrete-time two-asset markets 
(under certain technical conditions) [25], finding the TRP that maximizes the expected growth of wealth 
under proportional transaction costs is not solved, except for basic scenarios [25], to the best of our 
knowledge. 

In this paper, we first evaluate the expected wealth achieved by a TRP over any finite investment period 
given any target portfolio and threshold for two-asset discrete-time stock markets subject to proportional 
transaction fees. We emphasize that we study two-asset market for notational simplicity and our derivations 
can be readily extended to markets having more than two assets as pointed out in the paper where 
needed. We consider i.i.d. discrete-time markets represented by the sequence of price relatives (defined 
as the ratio of the opening price to the closing price of stocks), where the sequence of price relatives 
follow log-normal distributions. Note that the log-normal distribution is the assumed statistical model 
for price relative vectors in the well-known Black-Scholes model [8], [9] and this distribution is shown 
to accurately model real life stock prices by many empirical studies [8]. Under this setup, we provide 
an iterative relation that efficiently and recursively calculates the expected wealth over any period in 
any i.i.d. discrete time market. This iterative relation is evaluated using a certain multivariate Gaussian 
integral for the log-normal distribution. We then provide a randomized algorithm to calculate the given 
integral and obtain the expected growth. This expected growth is then optimized by a brute force method 
to yield the optimal target portfolio and threshold to maximize the expected wealth over any investment 
period. Furthermore, we also provide a maximum-likelihood estimator to estimate the parameters of 
the log-normal distribution from the sequence of price relative vectors, which is incorporated into the 
algorithmic framework in Simulations section since these parameters are naturally unknown in real life 
markets. 

Portfolio management problem is studied with transaction costs in [26] on the horse race setting, which 
is a special discrete-time market where only one of the asset pays off and the others pay nothing on each 
period. This basic framework is then extended to general stock markets in [25], where threshold portfolios 
are shown to be growth optimal for two-asset markets. However, no algorithm, except for a special sampled 
Brownian market, is provided to find the optimal target portfolio or threshold in [25]. To achieve the 



performance of the best TRP, a sequential algorithm is introduced in [27] that is shown to asymptotically 
achieve the performance of the best TRP tuned to the underlying sequence of price relatives. This 
algorithm uses a similar weighting introduced in [19] to construct the universal portfolio. We emphasize 
that the universal investment strategies, e.g., [27], which are inspired by universal source coding ideas, 
based on Bayesian type weighting, are heavily utilized to construct sequential investment strategies [3], 
[5], [11], [13]— [18]. Although these methods are shown to "asymptotically" achieve the performance of 
the best portfolio in the competition class of portfolios, their non-asymptotic performance is acceptable 
only if a sufficient number of candidate algorithms in the competition class is overly successful [15] to 
circumvent the loss due to Bayesian type averaging. Since these algorithms are usually designed in a min- 
max (or universal) framework and hedge against (or should even work for) the worst case sequence, their 
average (or generic) performance may substantially suffer [12], [28], [29]. In our simulations, we show 
that our introduced algorithm readily outperforms a wide class of universal algorithms on the historical 
data sets, including [27]. Note that to reduce the negative effect of the transaction costs in discrete time 
markets, semiconstant rebalanced portfolio (SCRP) strategies have also been proposed and studied in 
[12], [15], [20]. Different than a CRP and similar to the TRPs, an SCRP rebalances the portfolio only at 
the determined periods instead of rebalancing at the start of each period. Since for an SCRP algorithm 
rebalancing occurs less frequently than a CRP, using an SCRP strategy may improve the performance over 
CRPs when transaction fees are present. However, no formulation exists to find the optimal rebalancing 
times for SCRPs to maximize the cumulative wealth. Although there exist universal methods [13], [15] 
that achieve asymptotically the performance of the best SCRP tuned to the underlying sequence of price 
relatives, these methods suffer in realistic markets since they are tuned to the worst case scenario [15] 
as demonstrated in the Simulations section. 

We begin with the detailed description of the market and the TRPs in Section |II1 We then calculate the 
expected wealth using a TRP in an i.i.d. two-asset discrete-time market under proportional transaction 
costs over any investment period in Section |IlI] We first provide an iterative relation to recursively 
calculate the expected wealth growth. The terms in the iterative algorithm are calculated using a certain 
form of multivariate Gaussian integrals. We provide a randomized algorithm to calculate these integrals 
in Section IIII-CI The maximum-likelihood estimation of the parameters of the log-normal distribution is 
given in Section [TV] The paper is then concluded with the simulations of the iterative relation and the 
optimization of the expected wealth growth with respect to the TRP parameters using the ML estimator 
in Section [V] 



II. Problem Description 

In this paper, all vectors are column vectors and represented by lower-case bold letters. Consider a 
market with m stocks and let {x(i)}t>i represent the sequence of price relative vectors in this market, 
where x(£) = [xi(t), X2{t), ■ ■ ■ , x m (t)] T with Xi(t) € R + for iG {1,2,..., m} such that Xi(t) represents 
the ratio of the closing price of the ith stock for the fth trading period to that from the (t — l)th trading 
period. At each investment period, say period t, h(t) represents the vector of portfolios such that bi(t) is 
the fraction of money invested on the ith stock. We allow only long-trading such that YliLi bi{t) = 1 an ^ 
bi(t) > 0. After the price relative vector x(t) is revealed, we earn b T (t)x(t) at the period t. Assuming 
we started investing using 1 dollars, at the end of n periods, the wealth growth in a market with no 
transaction costs is given by 

n 

S(n) = l[b T (t) X (t). (1) 



t=i 



If we use a CRP [10], then we earn 



IIb T x(t), 



t=i 



at the end of n periods ignoring the transaction costs. This method is called "constant rebalancing" since 
at the start of each investment period t, the portfolio vector b(t) = [b±(t), 62(^)1 ■ • ■ > b m (t)] is adjusted, 
or rebalanced, to a predetermined constant portfolio vector, say, b = [61, 62, ... , b m ] where YmLi b i = 1- 
As an example, at the start of each investment period t, since we invested using b at the investment 
period t — 1 and observed x(t — 1), the current portfolio vector, say b id(i), 



boidW = 



6ixi(t-l) b m x m (t-l) 



should be adjusted back to b. If we assume a symmetric proportional transaction cost with cost ratio c, < 
c < 1, for both selling and buying, then we need to spend approximately Y^L\ bi t0 id(t)S(t)\bi }0 \d(t) — bi\c 
dollars for rebalancing. Note that if the transaction costs are not symmetric, the analysis follows by 
assuming c = c sc \\ + Cb uy by [20], where c sc \\ and c^, uy are the proportional transaction costs in selling 
and buying, respectively. Since a CRP should be rebalanced back to its initial value at the start of each 
investment period, a transaction fee proportional to the wealth growth up to the current period, i.e., S(t), 
is required for each period t. Hence, constantly rebalancing at each time t may be unappealing for large 
c. 

To avoid such frequent rebalancing, we use TRPs, where we denote a TRP with a target vector b and 
a threshold e (with certain abuse of notation) as "TRP with (b, e)". For a sequence of price relatives 
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Fig. 1: A sample scenario for threshold rebalanced portfolios. 
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vectors x n = [x(l), x(2), . . . , x(n)] with x G R+ , a TRP with (b, e) rebalances the portfolio to b at the 
first time r satisfying 



£ [ b j- e j> b j + e j] 



(2) 



Y,k=i b kllt=i x k(t) 

for any j £ {1,2, ... , m}, thresholds ej, and does not rebalance otherwise, i.e., while the portfolio vector 
stays in the no rebalancing region. Starting from the first period of a no rebalancing region, i.e., where 
the portfolio is rebalanced to the target portfolio b, say t = 1 for this example, the wealth gained during 
any no rebalancing region is given by 

m n 

W(x> n eC)=X>fcIIa*(i), (3) 

fe=i t=i 

where b n = [b(l),b(2), . . . , b(n)] with b(t) is the portfolio at period t and £^ c is the length n no 
rebalancing region defined as 



£™ = {h n | b(l) = b,bj(t) G (bj - e it bj + e,),j £ {1,2, ... ,m},t e {1,2,. . . ,n}}. 



(4) 



A TRP pays a transaction fee when the portfolio vector leaves the predefined no rebalancing region, i.e., 
goes out of the no rebalancing region £° c , and rebalanced back to its target portfolio vector b. Since the 
TRP may avoid constant rebalancing, it may avoid excessive transaction fees while securing the portfolio 
to stay close the target portfolio b, when we have heavy transaction costs in the market. 



For notational clarity, in the remaining of the paper, we assume that the number of stocks in the market 
is equal to 2, i.e., m = 2. Note that our results can be readily extended to the case when m > 2. We 
point out the necessary modifications to extend our derivations to the case m > 2. Then, the threshold 
rebalanced portfolios are described as follows. 

Given a TRP with target portfolio b = [b, 1 — b] T with b G [0, 1] and a threshold e, the no rebalancing 
region of a TRP with (b, e) is represented by (6 — e, b + e). Given a TRP with (b — e, b + e), we only 
rebalance if the portfolio leaves this region, which can be found using only the first entry of the portfolio 
(since there are two stocks), i.e., if 6i )0 id(*) £ (b — e,b + e). In this case, we rebalance 6i,oid(*) to b. 
Fig. Q] represents a sample TRP in a discrete-time two-asset market and when the portfolio is rebalanced 
back to its initial value if it leaves the no rebalancing interval. 

Before our derivations, we emphasize that the performance of a TRP is clearly effected by the threshold 
and the target portfolio. As an example, choosing a small threshold e, i.e., a low threshold, may cause 
frequent rebalancing, hence one can expect to pay more transaction fees as a result. However, choosing 
a small e secures the TRP to stay close to the target portfolio b. Choosing a larger threshold e, i.e., 
a high threshold, avoids frequent rebalancing and degrades the excessive transaction fees. Nevertheless, 
the portfolio may drift to risky values that are distant from the target portfolio b under large threshold. 
Furthermore, we emphasize that proportional transaction cost c is a key factor in determining the e. Under 
mild stochastic assumptions it has been shown in [11], [19] that in a market with no transaction costs, 
CRPs achieve the maximum possible wealth. Therefore in a market with no transaction costs, i.e., c = 0, 
the maximum wealth can be achieved when we choose a zero threshold, i.e., e = and a target portfolio 
b* = arg max i£ [log (fori + (1 — 6)^2)], where x\ and X2 represent the price relatives of two-asset market 

b 

[19]. On the other hand, in a market with high transaction costs, choosing a high threshold, i.e., a large 
e, eliminates the unappealing effect of transaction costs. For instance, for the extreme case where the 
transaction cost is infinite, i.e., c = 00, the best TRP should either have e = 1 or b € {0, 1} to ensure 
that no rebalancing occurs. 

In this paper, we assume that the price relative vectors have a log-normal distribution following the well- 
known Black-Scholes model [8]. This distribution that is extensively used in financial literature is shown 
to accurately model empirical price relative vectors [30]. Hence, we assume that x(t) = [x\(t), X2(t)] T 
has an i.i.d. log-normal distribution with mean fi = [fii,^] an d standard deviation a = [01, 02], 
respectively, i.e., x(t) ~ ln./VQu, a 2 ). Here, we first optimize the wealth achieved by a TRP for the 
discrete-time market, where the distributions of the price relatives are known. We then provide a ML 
estimator for these parameters to cover the case where the means and variances are unknown. The ML 



estimator is incorporated in the algorithmic framework in the Simulations section since the corresponding 
parameters are unknown in real life markets. The details of the maximum-likelihood estimation are given 
in Section HVl 

III. Threshold Rebalanced Portfolios 

In this section, we analyze the TRPs in a discrete-time market with proportional transaction costs as 
defined in Section JU We first introduce an iterative relation, as a theorem, to recursively evaluate the 
expected achieved wealth of a TRP over any investment period. The terms in this iterative equation are 
calculated using a certain form of multivariate Gaussian integrals. We provide a randomized algorithm 
to calculate these integrals. We then use the given iterative equation to find the optimal e and b that 
maximize the expected wealth over any investment period. 

A. An Iterative Relation to Calculate the Expected Wealth 

In this section, we introduce an iterative equation to evaluate the expected cumulative wealth of a TRP 
with (b — e, b + e) over any period n, i.e., E[S(n)]. As seen in Fig. [2j for a TRP with (6 — e, b + e), 
any investment scenario can be decomposed as the union of consecutive no-crossing blocks such that 
each rebalancing instant, to the initial b, signifies the end of a block. Hence, based on this observation, 
the expected gain of a TRP between any consecutive crossings, i.e., the gain during the no rebalancing 
regions, directly determines the overall expected wealth growth. Hence, we first calculate the conditional 
expected gain of a TRP over no rebalancing regions and then introduce the iterative relation based on 
these derivations. 

For a TRP with (b—e, b+e), we call a no rebalancing region of length n as "period n with no-crossing" 
such that the TRP with the initial and target portfolio b = [b, 1 — b] stays in the (b — e, b + e) interval 
for n — 1 consecutive investment periods and crosses one of the thresholds at the nth period. We next 
calculate the expected gain of a TRP over any no-crossing period as follows. 

The wealth growth of a TRP with (b — e, b + e) for a period r with no-crossing can be written as \J 

T T 

Snc(r) = bl[[xi(t)] + (1 - b) l[[x 2 (t)}, (5) 

4=1 t=l 

without the transaction cost that arises at the last period. To find the total achieved wealth for a period r 
with no-crossing, we need to subtract the transaction fees from ((5]). If portfolio bi(t) crosses the threshold 

'This is the special case of ((3} for rn — 2. 
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at the investment period t = r, then we need to rebalance it back to b, i.e., b\(t) = b and pay 



Snr.Mc 



(6) 



bWt=i(xi(t)) + (l-b)n T t=i(*2(t)) 
where c represents the symmetrical commission cost, to rebalance two stocks, i.e., &i )0 id( T + 1) to b, and 
&2,oid( T ' + 1) = 1 — bi t0 id(T + 1) to 1 — b. Hence, the net overall gain for a period r with no-crossing 
becomes 

bULMd)) 



S(r) = S nc (r) - S nc (r)c 

T 

= bH[x 1 (t)] + (l-b)l[[x 2 (t)}-c(b-b 
t=l t=l 

T T 



bIVt=i(xi(t)) + (l-b)lVt=i(x2(t)) 



i[[xi{t)]-ii[x2<it)] 



t=\ 



t=\ 



(7) 



t=l 



t=\ 



where & = b - 2c(b - b 2 ) and C2 = 1 - b + 2c(6 - b 2 ) for b + e hitting and £i = & + 2c(6 - b 2 ) and 
<^2 = 1 — b — 2c(b — b 2 ) for b — e hitting. Thus, the conditional expected gain of a TRP conditioned 
on that the portfolio stays in a no rebalancing region until the last period of the region can be found 
by calculating the expected value of (0. Since, we now have the conditional expected gains, we next 
introduce an iterative relation to find the expected wealth growth of a TRP with (b — e, b + e) for period 
n, E[S(n)], by using the expected gains of no-crossing periods as shown in Fig. [2l 

In order to calculate the expected wealth E[S(n)] iteratively, let us first define the variable R(t), which 
is the expected cumulative gain of all possible portfolios that hit any of the thresholds first time at the 
rth period, i.e., 

fc 



R(t) = E S(t 



b T G El 



(8) 



where £* denotes the set of all possible portfolios with initial portfolio b and that stay in the no rebalancing 
region for r — 1 consecutive periods and hits one of the b — e or b + e boundary at the rth period, i.e., 



A 



4 C = {b r G B T (b, e) I 6(1) = b, b(i) G (b - e, b + e) V* G {2, . . . ,r - 1}, 6(r) i (b - e, b + e)}. (9) 

Here, B T (b, e) is defined as the set of all possible threshold rebalanced portfolios with initial and target 
portfolio b and a no rebalancing interval (b — e, b + e). Similarly we define the variable T(t), which is 
the expected growth of all possible portfolios of length r with no threshold crossings, i.e., 



T{r) 



E 



S{t) b T G £ T nc 



(10) 
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Fig. 2: No-crossing intervals of threshold rebalanced portfolios. 



where £" c denotes the set of portfolios with initial portfolio b and that stay in the no rebalancing region 
for r consecutive periods, i.e.o 



A 



£™ = {b T e B T (b, e) I b{l) = b, b(i) € [b - e, b + e] Vi G {2, . . . , r}}. 



(11) 



Given the variables R(t) and T(t), we next introduce a theorem that iteratively calculates the expected 
wealth growth of a TRP over any period n. Hence, to calculate the expected achieved wealth, it is sufficient 
to calculate R(r), T(t), threshold crossing probabilities P (b n G £*) and P (h n € £n C ), which are 
explicitly evaluated in the next section. 



Theorem 3.1: The expected wealth growth of a TRP (b — e,b + e), i.e., E[S(n)], over any i.i.d. 
sequence of price relative vectors x ra = [x(l),x(2), . . . ,x(n)], satisfies 



E[S(n)} = £ P(£! c )R(i)E[S(n - i)] + P(£™)T(n), 



(12) 



i=l 



where we define S = 1, R{n) in ©, T{n) in CQ, £f in CD and £™ in ©. 



2 This is the special case of the definition in Q for m = 2. 
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We emphasize that by Theorem 13.11 we can recursively calculate the expected growth of any TRP over 
any i.i.d. discrete-time market under proportional transaction costs. Theorem 13. 1 1 holds for i.i.d. markets 
having either m = 2 or m > 2 provided that the corresponding terms in (ITZi can be calculated. 

Proof: By using the law of total expectation [31], E[S(n)] can be written as 

E[S(n)} = [ E[S(n)\b n ]P(b n )db n , (13) 

where B n (b,e) is defined as the set of all possible TRPs with the initial and target portfolio b and 
threshold e. To obtain (fT2l . we consider all possible portfolios as a union of n + 1 disjoint sets: (1) the 
portfolios which cross one of the thresholds first time at the 1st period; (2) the portfolios which cross 
one of the thresholds first time at the 2nd period; and continuing in this manner, (3) the portfolios which 
cross one of the thresholds first time at the nth period; and finally (4) the portfolios which do not cross 
the thresholds for n consecutive periods. Clearly these market portfolio sets are disjoint and their union 
provides all possible portfolio paths. Hence ([T"3l can also be written as 



n „ 

E[S(n)} = V / E[S(n)\h\ G S^, b? +1 G B n ^(b, e)]P(b[ G Sf c , bf +1 G B n ^(b, e))db* 

i=i Jb\&£l c ,K +1 aBn-i{b,e) 

+ / E[S{n)\h n G £™]P{b n G C)db n , (14) 

Jb"e£° c 



where bj = [b(i),b(i + 1), . . . ,b(j)]. To continue with our derivations, we define S^j as the wealth 
growth from the period i to period j, i.e., Si-+j = -^M. Assume that in the period r, the portfolio 
crosses one of the thresholds and a rebalancing occurs. In that case, regardless of the portfolios before 
the period r, the portfolio is rebalanced back to its initial value in the rth period, i.e., to [b, 1 — b] T . 
Since the price relative vectors are independent over time, we can conclude that the portfolios before 
the period r are independent from the portfolios after the period r, i.e., 6(r) = b and every portfolio 
b(i) for i G {1, 2, . . . , r — 1} are independent from the portfolios b(j) for j G {r + 1, r + 2, . . . , n}. 
Hence, the investment period where the portfolio path crosses one of the thresholds, i.e., r, divides the 
whole investment block into uncorrelated blocks in terms of price relative vectors and portfolios. Thus, 
the wealth growth acquired up to the period r, Si^ T , is uncorrelated to the wealth growth acquired after 
that period, i.e., S T+ i^. n . Hence, if we assume that a threshold crossing occurs at the period r, then we 
have 



E 



S(n)\b[ G 4 C X +1 G B n . T (b,e) = E SW^r+i-^lbT G 4°X +1 G B n - T (b,e) 



E 



S 1 ^ T \b\ G £f c ] E [Sr+^nlb^ G B n -i(b, e)] . (15) 
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Applying (TT5T ) to (fl4l ). we get 

Si-rtlbi g 4 C ] ^ [S i+ i_>„|&(i) = 6,b™ +1 G B„_i(6,€)] P (M g £f) 

P(b? +1 eB n ^{b, e))db n + f E[S(n)\b n ££* c }P(b n eS* c )db n . (16) 



E[S(n)]=J2 / s 

i=1 >/bi6£| ,bt' +1 6B n _ i (6,e) 



V / £[Si-*|bj G £f c ]P(bi G 4 c )dbi / P[S m ^ n |6(i) = 6, b? +1 G £ n _ 4 (6, e)] 

(17) 



Since the integral in ([ToT i can be decomposed into two disjoint integrals, (fT4l ) yields 
£7[S(n)] 

x P(b™ +1 G Bn-^6, e))db? +1 + f E[S(n)\b n G f^]P(b n G 5^)db n . 

We next write (fTTT l as a recursive equation. 

To accomplish this, we first note that 
(i) R(i) is defined as the expected gain of TRPs with length i, which crosses one of the thresholds first 
time at the i-th period and it follows that 



R(i) = E S(t 

1 



b' G £? } 

E[S^i\h[ G £\ c ]P(b\ G £l c )db\, 



(18) 
(19) 



where we write P(£f c ) instead of P(b\ G £f c ). 

(ii) Then, as the second term, T(n) is defined as the expected gain of TRPs of length n, which does not 

cross one of the thresholds for n consecutive periods. This yields 



T(n) = E S(n) 



b n G C 



E[S(n)\b n G £™}p(b n G £^)db n . 



(20) 
(21) 



P{£-n C ) Jh^eS" 
(iii) Finally, observe that the second integral in (fTTT l is the expected wealth growth of a TRP of length 

n — i, i.e., 

E[S(n -»)]=/ E[S i+1 .+ n \b(i) = b,b? +1 G B n ^(b,e)]p(bf +1 G B n _ i (6,e))db? +1 , (22) 

where p(h™ +l G B n -i(b, e)) = 1 by the definition of the set B n -i(b, e). 
Hence, if we apply (fl9l , (1211 and (f22]> to ([17]), we can write (HU) as 



E[S(n)\ = J2 P{£l c )Rii)E[S{n - i)] + P(£» c )T(n), 
hence the proof concludes. 



(23) 



i=X 



14 



Theorem 13. 1 1 provides a recursion to iteratively calculate the expected wealth growth E[S(n)], when 
R{t) and T(t) are explicitly calculated for a TRP with (6— e, b+e). Hence, if we can obtain P (££ c ) R(t) 
and P (£" c ) T(t) for any r, then (fT2l ) yields a simple iteration that provides the expected wealth growth 
for any period n. We next give the explicit definitions of the events £— and £" c in order to calculate 
the conditional expectations R(t) and T(t). Following these definitions, we calculate P (£^ c ) R{t) and 
P (££ c ) T"(r) to evaluate the expected wealth growth E[S(t)], iteratively from Theorem 13. II and find the 
the optimal TRP, i.e., optimal b and e, by using a brute force search. 

In the next section, we provide the explicit definitions for £* and ££ c , and define the conditions for 
staying in the no rebalancing region or hitting one of the boundaries to find the corresponding probabilities 
of these events. 

B. Explicit Calculations of R(n) and T(n) 

In this section, we first define the conditions for the market portfolios to cross the corresponding 
thresholds and calculate the probabilities for the events £™ and £° c . We then calculate the conditional 
expectations R(n) and T(n) as certain multivariate Gaussian integrals. The explicit calculation of mul- 
tivariate Gaussian integrals are given in Section IIII-CI 

To get the explicit definitions of the events £*. c and £ £ c , we note that we have two different boundary 
hitting scenarios for a TRP, i.e., starting from the initial portfolio b, the portfolio can hit b — e or b + e. 
From b, the portfolio crosses b — e boundary if 

b UU(Mt)) <b _ e 

bWt=i(xi(t)) + (i-b)U T t=1 (Mt))- ' l ; 

where r is the first time the crossing happens without ever hitting any of the boundaries before. Since 
xi(i),X2(i) > for all i, d24l) happens if 

l\ Xl (t)-(l-b)(b-ey ^ 

which is equivalent to 

n 2 (r)>7in 1 (r), 

where ili(i) = rit=i x i(*)> ^(i) = rit=i x 2(£) and 71 = (1 _M(fclg\ - Since x(i)'s have log-normal 
distributions, i.e., x(t) ~ lnA/"(/x, a 2 ), H\(i) and 112(2) are log-normal, too [31]. Furthermore, to calculate 
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the required probabilities, we have 

p(ni(*),ni(fc - i),ni(fc)) =p(ni(i),ni(*; - i)) P (iii(fc)|ni(A; - i),ni(0) 

= p(ni(»)) P (n 1 (A; - ljinicojpcn^fe - i)xi(fc)|ni(fc - i),ni(i)) 
= p(ni(»))p(n 1 (* - i)|ni(i))p(ni(fc)|ni(A: - 1)) , (26) 

Vi £ {0, 1, . . . , k — 2}, where (l26l ) follows since a; (A;) is independent of Ili(i) for k > i. Hence IIi(z)'s 
form a Markov chain such that Ili(i) O Iii(k — 1) o n 1 (A;) Vi G {0, 1, . . . , A; — 2}. Following the 
similar, steps we also obtain that IT^) <->• II2 (A; — 1) <-> H^A;) , Vi € {0, 1, . . . , A; — 2}. We point out that 
by extending the definitions 111 and II2 one can obtain III, II2, . . . , n m for the case m > 2. Furthermore, 
taking the logarithm of both sides of (|25l l we have 

T 

A 



El = 5>(t)>0i, 



t=l 
where z(t) = In ( ^44 J and 9\ = In ,±_ b \, b _L = In 71. The partial sums of z(t)'s are defined as 
S^ = ^2 t=i z(t) for notational simplicity. Since x(t) ~ lnjV(/Lt, <r 2 ), z(t)'s are Gaussian, i.e., z(£) ~ 
A/"(/U, a -2 ), where \i = Hi — p,\ and cr 2 = <r 2 + <r 2 , their sums, Ef's, are Gaussian too. Furthermore note 

that, £* = Etx *(*) = Ej=i ^ (g$) = In {uU g$) = In g$}. 

Similarly with an initial value b, market portfolio crosses b + e boundary if 

6IILi(*i(*)) >6 + e (27) 

fcn t T =1 (xi(t)) + (i-6)ni =1 (x 2 (t))- + ' l ; 

where r is the first crossing time without ever hitting any of the boundaries before. Again, since 
x\(i),X2(i) > for all i, (|27T ) happens if 

frX 2 {t) 6(1-6-6) 

^(i) -(l-6)(6 + e )' Ubj 



which can be written of the form 



Equation (128b yields 



n 2 (t)<72rii(£). 

r 



A , b(i-b-e) 



t=i 

where # 2 = In ( ^" 6 ~+e) = ln ^- 

Hence, we can explicitly describe the event that the market threshold portfolio (6 — e, 6 + e) does not 
hit any of the thresholds for r consecutive periods, £° c , as the intersection of the events as 

A 

1=1 4=1 



£ T nc = f| {si g [5 2) <?i]} = f| {7 2 nx(i) < n 2 (») < 7ini«}. (29) 
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Similarly, the event of the market threshold portfolio (b — e, b + e) hitting any of the thresholds first time 
at the r-th period, £*, can be defined as the intersections of the events 



T-l 



£ l T C = f| {SI e [02,9^} fl [{S T e [-00,02)} Ul^r G (01, OO]} 

j=l 

T-l 

= fl { 72 ni(i) < n a (») < 7ini(i)} f| [{n 2 (r) > 7 ini(r)} J{n 2 (r) < 72 ni(r)} 



(30) 



i=l 



yielding the explicit definitions of the events ^ c in (1301 and £" c in ( f29l ). The definitions of £^ c and £* c 
can be readily extended for the case m > 2 by using the updated definitions of IIi, II2, . . . , n m . 

Since we have the quantitative definitions of the events £~ and £y c , we can express the expected 
overall gain of portfolios with no hitting over r-period, T{t), as 

t(t) = e[s(t) \ e?[ 

T T 

= E[bH[ Xl (t)] + (l-b)ll[x 2 (t)}\£^ 
t=i t=i 

= £[hIi(t) + (1 - 6)H 2 (r) J £* c ] • (31) 



The expectation £ MIi(t) + (1 - 6)n 2 (r) 



SI 



E 



6ni(r) + (l-6)n 2 (r) 



e 



can be expressed in an integral form as 

(fori + (1 - 6)tt 2 ) 

E™\ dvr.dvn 



^0 



xP(n 1 (r) = 7ri,n 2 (r)=7r 2 



(32) 



by the definition of conditional expectation. To extend this for the case m > 2, the double integral 
in the definition of T T d32l ) is replaced by an m-dimensional integral over updated random variables 
IIi, n 2 , . . . , n m . Combining (1321 and (13Tb yields 



/■OO f'OO . 

T(t) = (tmi + (1 " b)7r 2 ) P (ni(r) = tti, II 2 (t) = vr 2 



P(£ 
xPfs. 



I /"00 /"OO 

— r / / (6tti + (1 - 6)7r 2 ) P(n 1 (r) = 7Ti,n 2 (r)=7r 2 ) 



Ili(r) = 7ri,II 2 (r) = 7r 2 ) d7r 2 d7Ti 



P(B\A)P(A) 



(33) 



by Bayes' theorem that P(A\B) = - L^ ■ If we write the explicit definition of £^ c given in d29l ). 
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then we obtain 

P (8?) T(r) = 



OO /"OO 



'0 JO 

,...,7 2 ni(r)< Uo(r) £-jlli(r)| Llj(r)- rr,. i.l : >(r) - ;r ; > 

fOO / , 7l7Tl 



(6tti + (i - 6)7r 2 ) p(ni(r) = 7n,n 2 (r) = 7r 2 )p[72ni(i) < n 2 (i) < 7 ini(i) 

d7r 2 d7Ti 



/■OO /"7i7Ti 

/ / (6tti + (1 - 6)7r 2 ) P(n 1 (r) = 7r 1 ,n 2 (r) = 7r 2 ) 



X P 



72 



7Tl 



< 



7T 2 



< 



71- 



TTl 



,72^ 



7Tl 



< 



7T2 



< 



71- 



7Tl 



111=2^1 (*) rE=2 aj 2(*) rE=2 a; i(*)' nr=s^i(*) nLs^w rcu^i® 

d7r 2 dvri (34) 



7Tl 7T 2 7Tl 

'72 — FT ^ — TT ^ 71 



'xi(r) z 2 (r) xi(r) 

where (l34l follows by the definitions of Ili(i) and II 2 (i), i.e., ITi(i) = nt=i x i(^) = rr — 1 (t) anc ^ 
1^2 (z) = rifci x 2(i) = t-tt r ,-, v If we rearrange the inequalities in (l34l to put the product terms 
together, which does not affect the direction of the inequality since all terms are positive, then we obtain 



P (£n T(r) 



OO / , 7l7Ti 



(fori + (1 - 6)vr 2 ) P(IIi(t) = 7ri,n 2 (r) = tt 2 )P 



^7 2 7T 



![L. < TT x a(t) < tt 2 



L7T171 



t=2 



iCl(0 7Ti7 2 ' 



7T2 



< 



n 



2?2 (t) vr 2 



7T 2 X 2 (,TJ 7T 2 



"I.')! " Wl(t) 7Ti7 2 ' " " ' 7T171 X! (r) 7Ti7 2 



d7r 2 d7Ti 



/■OO /•7i7Ti . 

/ / (fori + (1 - 6)tt 2 ) P(U 1 (T)=7r 1 ,U 2 (T)=7r 2 )P(^ T 2 e[ K -e 1 , K -e 2 },^ T 3 G[ K -e 1 , K -e 2 ], 

JO J-J2TT1 

. , ££ G [K — 9\,K — 2 ] ) d7T 2 d7Ti, 



(35) 



/I 



which follows from the definition of T^ where n = In -j^-. The first probability in (l35l l can be calculated 

as 



P (IIi(t) = 7Ti, n 2 (r) = vr 2 ) = P (IIi(t) = vn) P (n 2 (r) = vr 2 ) 



1 



(ln£]-T£ir 



(Icir 2 -Tf2> 



TTiy^TTTaf 



A 



+ 



TTiy/^K 



(36) 



TO"o 



which follows since Ili(r) = n*=i x i(*) an d n 2 (r) = ni=i x 2(£) and we have Ili(r) ~ lnA/"(rjUi,rcrf) 
and n 2 (r) ~ ln7V(r^ 2 ,rcj 2 ). The corresponding terms in (l35l are written as a multi variable integral 
calculated in Section IIII-CI 

Following similar steps, we can obtain the expected overall gain R(t) as 

R(r) = E [S(r) I £*" 

T T T T 

•n^iW] + a - 6) n^(t)] - 2 C (6 - 6 2 )i ij[xi(*)] - n[xa(t)]i | ^ 

t=l t=l t=l t=l 



p 



(37) 



18 



The conditional expectation E S(t 



£. 



fc 



can also be expressed in an integral form as 



E 



S(r) 



-fc 



JO 



S(r) P n x (r) = TTi , E 2 (r) = tt 2 £* dvrsdvn , 



(38) 



which follows from the definition of conditional expectation. Combining (|38T ) and (I37T ) yields 



£l c ) dvr 2 d7n 



/■OO /'OO - 

R(t) =11 S(r) P (ni(r) = TTi, n 2 (r) = n 2 

1 /'OO /"OO 

= P~W)J Jo S ^ P( ^^ = 7T ^ U ^ = ^ 



xPl£. 



.-fc 



il^r) = 7Tl,n 2 (r) = 7T 2 ) d7T 2 d7Tl 



(39) 

where (l39l follows from the B ayes' theorem. Note that the definition of R(t) (l39l can be extended for 
the case m > 2 by replacing the double integral with an m-dimensional integral over the updated random 
variables Ei, n 2 , . . . , E m . If we replace the event ££ c with its explicit definition in (l30b . then we get 



-fc 



OO POO 



p n:r } B(t) - J I (Citti + c 2 7r 2 ) P(ni(r) = vn,n 2 (r) = vr 2 )p 72 ni(i) < n 2 (i) < 7iEi(i),. 



72ni(r - 1) < n 2 (r - 1) < 7l ni(r - l),7ini(r) < n 2 (r) ^(t) = 7Ti,n 2 (r) = vr 2 
(Cstti + C 4 vr 2 ) P (Ei(r) = tti, E 2 (r) = vr 2 ) pkn^l) < E 2 (l) < 7 iEi(l), . 



d7T 2 d7T 



JO 



72Ei(r - 1) < n 2 (r- 1) < 7lEi(r - l),7 2 Ei(r) > E 2 (r) E^r) = 7Ti,E 2 (r) = vr 2 



d7T 2 d7Tl, 



(40) 



where & = b - 2c{b - b 2 ), C 2 = 1 - b + 2c(b - b 2 ) , Cs = b + 2c{b - b 2 ) and U = 1 - b - 2c{b - b 2 ). 
We next calculate the first integral in (|40l and the second integral follows similarly. 

By the definitions of Ei(i) and E 2 (i), we have Ei(i) = II^i^iX*) = rp — l (t) anc ^ ^ 2 (i) = 
FC=i x 2(t) = ttf (+\ > hence the first integral in (1401 can be written as 

1 it— i+l X 2\t) 



(ClTTl + C27T 2 ) P(El(r) = TTI, E 2 ( T ) = 7T 2 )P 



72 



^7i7ri 
7Tl 



72 



7T1 



< 



7T2 



^7ittt 



7!"1 



< 



7T2 



^71 TTF 



71"! 



nLs^iW nLs^w nr=3 a:; i( i ) 



[1^=2^1 (<) Ut=2 X 2(t) 111=2 ^iW 

d7T 2 d7Tl. (41) 



7T1 7T 2 7Ti 

,72 TT ^ TT ^ 71 



'ici(r) x 2 (t) ici(r). 
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If we gather the product terms in (1411 into the same fraction, then we obtain 

f°° r (Civri+C 2 vr 2 ) P(n 1 (r)=7ri,n 2 (r)=7r 2 )p[-^-<n^< 
Jo -A**, L7r i7i ttxitt) 



'0 ^71771 

vr 2 < rr z 2 (*) 7r 2 7t 2 x 2 (r) 7r 2 



7T171 ^^lW 7r l72' 

_ I | ,,, - - , , _ d7T 2 d7Ti (42) 

71-171 £=3 Xl W 7I "l72 7T171 ^l( r ) 7T172-I 

/•QO /•CX3 , 

= / / (Civri + C 2 vr 2 ) P(ni(r)=7ri,n 2 (T)=7r2)P(S5e[AC-5 lj /c-e 2 ],E5e[«-ei,«;-fc], 

JO J-ym! v 

...,Ej;e[«-0i,K-^ 2 ])d7r 2 d7ri, (43) 

which follows from the definition of E^ where k = In 212 -. Following similar steps that yields (l43l . we 
can calculate d40l ) as 

p(sfjR(r) = / (Ci7Ti + C27T 2 ) P(n 1 (r) = 7r 1 ,n 2 (r)= 7 r 2 )p(sJ 3 e[«-^i,«-0 2 ], 



'7: 

EJ G [k - d, k - 2 ] Sit [/,■ - (>i . /,■ - ()■>} j d^Jrr, 

coo /•7 2 7r 1 



+ / / (Csvri + C^) P(n 1 (r)=7ri,n 2 (r)=7r 2 )P (EJ; G [« - 0i,s- 2 ], 

JO JO v 

El G [k - 0i,K - 2 ], ■ ■ ■ , K G [k - 0i, K - e 2 }) d^dvn, (44) 



where the probability P(IIi(t) = 7Ti,n 2 (r) = 7r 2 ) can be obtained via (l36l) . Hence to calculate 
P(£^ c )T(t) and P (£ lc ) P(r), we need to calculate the probability pfe 6 [k - x ,k - 2 ],E£ G 

[k - 01, K - 2 ], . . . , X£ € [k - 0i, K - 02]) in (El and ©. 
Following from the definition of E^s, we have 

p(E t *, Ef +lJ Ej) = p(E* +1 , E>(Ef |Ef +1 , Sj) 



= p(Ej>(Ef +1 |Sf )p(Ef +1 + z(i)|S? +1) E$) 

= p(E>(Ef +1 |E>(E*|E* +1 ) (45) 

V* G {0, 1, . . . , fc — 2}, where (I43T ) follows since z(z) is independent of E| for j > i. Then, E^'s form 
a Markov chain such that E^ <H> Ef +1 «-»■ E* Vz G {0, 1, . . . , fc — 2} and j > i Hence, we can write the 
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probability 

p(y? 2 g [« - e u k - e 2 ], ?>l e [k - e u K - e 2 ], . . . , K g [« - e u K - o 2 ] 

pK—62 rn—9 2 pK-—02 

= / ••• / P(E>t = si)^_! = s 2 ,...,E£ = s r _i) ds T _ids r _2 ■ ■ ■ dsi 

Jk-6 1 Jk-9i Jk-0! 

pK—62 pK—62 rK—Qo 

= / .../ P(S5 = s t _ 1 |S5 = St _ 2 )P(S5 = St „ 2 |SJ = St _ 3 )... 

Jk-0! Jk-B-l Jk-6! 

P{K-X = *2\K = si)P(K = ai) d Sr _id Sr - 3 • • • ds 2 d Sl , (46) 

where (l46l l follows by the chain rule and Ej's form a Markov chain. We can express the conditional 
probabilities in (l46l i. which are of the form P(T>J = s r _j|E[ +1 = s T _j_i), as 

P (^1 = s r-i+l|S[ +1 = S T _j) = P(E i+1 + z(») = S T _j + l|S[ +1 = S r _j) 

= P(s T -i + z(i) = s T _ i+ i|S[ +1 = s T _j) 

= -T (z(zj = S r _j_|_i — S r _j|Zjj +1 = <S T -i) 

= P(z(z) = s T _ i+1 - s r _i) (47) 

where (|47"T ) follows from the independence of z(i) and z(&)'s for i < fe < r or the independence of 
z(i) and EL. x = X]fc=i+i #(&)< If we replace (|47"T ) with the conditional probabilities in (l46l ) and use 
P(E£ = si) = P(^(r) = si), then we obtain 

pfe G [k - 6>i, k - 2 ], E| G [k - 0i, K - 6 2 ], . . . , E; G [as - 01, « - 2 ] 

[■K — O2 fK — 82 PK—02 

= / ••• / fz(sr-i -s T -2)f z (s T -2 - Sr-3) . . . f z (s2 - si)/ 2 (si) ds T _ids r _ 2 . . . ds 2 d,si 

J K—Q\ J K—9i J K—9i 

rK—62 pK—62 pit— 62 1 

= / / .../ (-^)^ i e ^^- 1 ( s '- s '- 1 -^ 2+ ( Sl -^ 2 d Sr _ 1 d ST _ 2 ...d S2 d Sl , (48) 

Jk-6i Jk-6% Jk-6i ^ 7ra 

where (l48l follows since z(i)'s are Gaussian, z ~ J\f(fJ,,<J 2 ), i.e., / z (.) is the normal distribution. Hence 
in order to iteratively calculate the expected wealth growth of a TRP, we need to calculate the multivariate 
Gaussian integral given in (l48l . which is investigated in the next section. 



C. Multivariate Gaussian Integrals 

In order to complete calculation of the iterative equation in (fT2b . we next evaluate the definite multi- 
variate Gaussian integral given in (1481 on the multidimensional [k — 9\,K — 2 ] n space. We emphasize 
that the corresponding multivariate integral cannot be calculated using common diagonalizing methods 
[32]. Although, in (l48l . the coefficient matrix of the multivariate integral is symmetric positive-definite, 



21 



A Pseudo-code of QMC Algorithm for MVN Integrals: 



1. get S, a, b, N, M and a 

2. compute lower triangular Cholesky factor L for S, permuting a and b, and rows and columns of X for variable prioritization. 

3. initialize P = 0, N = 0, V = 0, and q = ^/p with p = (2, 3, 5, . . . , p^) where pj is the j-th prime. 

4. for i = 1,2,..., M do 

Ii = and generate uniform random A £ [0, l] k shift vector, 
for j = 1, 2, . . . , N do 

w = |2(jq+A)-l| , 

di = * te) ■ ei = * te) and /! = *-*■ 

for m = 2, 3, . . . , k do 

2/m-i = *~ 1 (rfm-i +w m _i(e m _i - d OT _i)), 



(C. = * 



e m = * 



a m-Z J „ = 1 lm.-n.Vj 
lm,m 

b m-12™=l lm.-n.Vj 



Jm — \pvn dm)Jm—l- 

endfor 

h = Ii + (/m - Ii)/j. 
endfor 

o- = (Jj - t)/j, P = P + cr, V=(i- 2)V/i + cr 2 and E = ay/V 
endfor 
5. output P ?3 $fe(a, b, S) with error estimate E. 



Fig. 3: A randomized QMC algorithm proposed in [33] to compute MVN probabilities for hyper- 
rectangular regions. 



common diagonalizing methods cannot be directly applied since the integral bounds after a straightforward 
change of variables depend on j/j. However, (|48T ) can be represented as certain error functions of Gaussian 
distributions. 

We note that the multivariate Gaussian integral given in (|48T ) is the "non-central multivariate normal 
integral" or non-central MVN integral [33] and general MVN integrals are in the form [33] 

1 f-bx rb 2 pb k 

r it I -i v tv-i. 



$*(a,b,El 



e 2 



■x T S- 1 x 



dxk ■ ■ ■ dx 2 dxi, 



(49) 



^|S|(2vr) fc J ai Ja 2 

where S is a symmetric, positive definite covariance matrix. In our case, (|48T ) is a non-central MVN 
integral which can be written in the form (|49l , where k = r — 1 and the inverse of the covariance matrix 
is given by 
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S" 1 



2 -1 

-1 2 -1 



-1 2 -1 
-1 2 
which is a symmetric positive definite matrix with |S| = 1, the lower bound vector is of the form, 

a= [oi,... ,a T -i] T , 

k — 9\ — \l 
k — 9\ — 2fi 

n — 9\ — (r — l)fi 
and the upper bound vector is given by, b = [b±, . . . , fe T _i] T , 

k - 9 2 - \x 
K — 9 2 — 2fl 

k - 9 2 - (t - 1)/J, 

where —kfi terms in the lower and the upper bounds follow from the non-central property of <148T >. We 
emphasize that the MVN integral in d49l ) cannot be calculated in a closed form [33] and most of the 
results on this integral correspond to either special cases or coarse approximations [33], [34]. Hence, 
in this paper, we use the randomized QMC algorithm, provided in Fig. [3] [33] for completeness, to 
compute MVN probabilities over hyper rectangular regions. Here, the algorithm uses a periodization and 
randomized QMC rule [35], where the output error estimate E in Fig. [3] is the usual Monte Carlo standard 
error based on JV samples of the randomly shifted QMC rule, and scaled by the confidence factor a. 
We observe in our simulations that the algorithm in Fig. [3] produce satisfactory results on the historical 
data [15]. We emphasize that different algorithms can be used instead of the Quasi-Monte Carlo (QMC) 
algorithm to calculate the multivariable integrals in (@8), however, the derivations still hold. 



IV. Maximum-Likelihood Estimation of Parameters of the Log-Normal Distribution 

In this section, we give the MLEs for the mean and variance of the log-normal distribution using the 
sequence of price relative vectors, which are used sequentially in the Simulations section to evaluate the 
optimal TRPs. Since the investor observes the sequence of price relatives sequentially, he or she needs 
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to estimate /i, and a at each investment period to find the maximizing b and e. Without loss of generality 
we provide the MLE for x\(t), where the MLE for X2(t) directly follows. 

For these derivations, we assume that we observed a sequence of price relative vectors of length N, 
i.e., (xi(l),xi(2), . . ., x\(N)). Note that the sample data need not to belong to N consecutive periods 
such that the sequential representation is chosen for ease of presentation. Then, we find the parameters 
yUi and a\ that maximize the log-likelihood function 

N 

lnC(fi 1 ,aj\x l (l),x 1 (2),...,x l (N)) = liif(x 1 (l),x 1 (2),...,x l (N)\n 1 ,aj) = ^2liif{x 1 {i)\fi l ,a 1 ), 

where f(x\fj,x, a() = — , 2 e ^ . The log-likelihood function in (1501 ) can also be written as 

N 2 

\nCQi 1 ,oi\x 1 (l),x 1 (2),...,x 1 (N)) = J2^ /x , e""" "■'--■"' 

i=\ X\(l)\J2-K(Jx l 

N 

= Vln ^ .yii!^™, (50) 



)V2ttoi 
1 ^ (lnxi(i) -fiif 



We start with maximizing the log-likelihood function ln£ with respect to /xi, i.e., find the estimator fli 
that satisfies g = 0. If we take the partial derivative of the expression in (|50l with respect to fi\, then 
we obtain 

<91n£ yr^lnxi(i) — Hi 

Hence fj,i, which satisfies M^- = 0, or the ML estimator jl\ of y,\, can be found as 

1 N 

1=1 

To find the ML estimator of the variance o\, we find a\ that satisfies i"^ = 0. Since [i\ that satisfies 
■M- = in (I5TT) does not depend on <rf , we can use it in (l50l . Let us define cci = X^i=i "^ f° r 
notational clarity. By replacing afi with /^i in (I5TT) and taking the partial derivative of the expression with 
respect to af, we obtain 

dlnC ' ' X 



N l V-^/i ^ -nS 



i=l 

Hence 

AT 



0a? 2^ 2{a{f 

1 * 



N 

8=1 
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Following similar steps, the ML estimators for X2{t) yield 

N 

N 



1 
^2 = ^^lnx 2 (i), (53) 



and 

N 



4 = ^T,(^x 2 (i)-x 2 ) 2 , (54) 



where x 2 = Y^i=i "at Note that the ML estimators /zi, erf, /i2 and a\ are consistent [36], i.e., they 
converge to the true values as the size of the data set goes to infinity, i.e., N — > oo [31]. 

V. Simulations 

In this section, we illustrate the performance our algorithm under different scenarios. We first use 
TRPs over simulated data of two stocks, where each stock is generated from a log-normal distribution. 
We then continue to test the performance over the historical "Ford - MEI Corporation" stock pair chosen 
for its volatility [12] from the New York Stock Exchange. As the final set of experiments, we use our 
algorithm over the historical data set from [10] and illustrate the average performance. In all these trials, 
we compare the performance of our algorithm with portfolio selection strategies from [10], [15], [27]. 

In the first example, each stock is generated from a log-normal distribution such that x\{t) ~ ln.A/"(0.006, 0.05) 
and x 2 {t) ~ lnA/"(0.003, 0.05), where the mean and variance values are arbitrarily selected. We observe 
that the results do not depend on a particular choice of model parameters as long as they resemble real 
life markets. We simulate the performance over 1100 investment periods. Since the mean and variance 
parameters are not known by the investor, we use the ML estimators from Section |IVJ which are then used 
to determine the target portfolio b and the threshold value e. We start by calculating the ML estimators 
using the initial 200 samples and find the target portfolio b = [b 1 — b] T and the threshold e that 
maximize the expected wealth growth by a brute-force search. Then, we use the corresponding b and 
e during the following 200 samples. In similar lines, we calculate and use the optimal TRP for a total 
of 900 days, where b and e are estimated over every window of 200 samples and used in the following 
window of 200 samples. We choose a window of size 200 samples to get reliable estimates for the 
means and variances based on the size of the overall data. In Fig. |U we show the performances of: this 
sequential TRP algorithm "TRP", the Cover's universal portfolio selection algorithm [10] "Cover", the 
Iyengar's universal portfolio algorithm [27] "Iyengar" and a semiconstant rebalanced portfolio (SCRP) 
algorithm [15] "SCRP", where the parameters are chosen as suggested in [15]. As seen in Fig. [4] the 
TRP with the parameters sequentially calculated using the ML estimators is the best rebalancing strategy 
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Performance of TRP under c=0.25 — Log-normally Simulated Market 
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Performance of TRP under c=0.1 — Log-normally Simulated Market 
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(b) 

Fig. 4: Performance of various portfolio investment algorithms on a Log-normally simulated two-stock market, (a) 
Wealth growth under hefty transaction cost (c=0.025). (b) Wealth growth under moderate transaction cost (c=0.01). 



among the others as expected from our derivations. In Fig. |4a] and Fig. [4bJ we present results for a mild 
transaction cost c = 0.01 and a hefty transaction cost c = 0.025, respectively, where c is the fraction paid 
in commission for each transaction, i.e., c = 0.01 is a 1% commission. We observe that the performance 
of the TRP algorithm is better than the other algorithms for these transaction costs. However, the relative 
gain is larger for the large transaction cost since the TRP approach, with the optimal parameters chosen 
as in this paper, can hedge more effectively against the transaction costs. 

As the next example, we apply our algorithm to historical data from [10] from the New York Stock 
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Performance of TRP under c=0.25 — Ford and MEI Corp. 
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Performance of TRP under c=0.1 — Ford and MEI Corp. 
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Fig. 5: Performance of various portfolio investment algorithms on Ford - MEI Corporation pair, (a) Wealth growth 
under hefty transaction cost (c=0.025). (b) Wealth growth under moderate transaction cost (c=0.01). 



Exchange collected over a 22-year period. We first apply algorithms on the "Ford - MEI Corporation" pair 
as shown in Fig. [5l which are chosen because of their volatility [12]. In Fig. [51 we plot the wealth growth 
of: the sequential TRP algorithm with the optimal parameters sequentially calculated, the Cover's universal 
portfolio, the Iyengar's universal portfolio and the SCRP algorithm with the suggested parameters in [15]. 
We use the ML estimators to choose the optimal TRP as in the first set of experiments, however, since the 
historical data contains 5651 days we use a window of size 1000 days. Hence, the performance results 
are shown over 4651 days. As seen from Fig. [5] the proposed TRP algorithm significantly outperforms 
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Fig. 6: Performance of various portfolio investment algorithms on Ford - MEI Corporation pair, (a) Wealth growth 
under hefty transaction cost (c=0.025). (b) Wealth growth under moderate transaction cost (c=0.01). 



other algorithms for this data set. Similar to the simulated data case, we investigate the performance of 
the TRP algorithm under different transaction costs, i.e., a moderate transaction cost c = 0.01 in Fig. l5bl 
and a hefty transaction cost c = 0.025 in Fig. [5a] Comparing the results from the Fig. [5a] and Fig. [5b] we 
conclude that the TRP with the optimal sequential parameter selection can better handle the transaction 
costs when the stocks are volatile for this experiment. 

Finally, to remove any bias on a particular stock pair, we show the average performance of the TRP 
algorithm over randomly selected stock pairs from the historical data set from [10]. The total set includes 
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34 different stocks, where the Iroquois stock is removed due to its peculiar behavior. We first randomly 
select pairs of stocks and invest using: the sequential TRP algorithm with the sequential ML estimators, the 
Cover's universal portfolio algorithm, the Iyengar's universal portfolio algorithm and the SCRP algorithm. 
The sequential selection of the optimal TRP parameters are performed similar to the previous case, i.e., 
we use ML estimators on an investment block of 1000 days and use the calculated optimal TRP in the 
next block of 1000 days. For each stock pair, we simulate the performance of the algorithms over 4651 
days. In Fig. [6] we present the wealth achieved by these algorithms, where the results are averaged over 
10 independent trials. We present the achieved wealth over random sets of stock pairs under a moderate 
transaction cost c = 0.01 in Fig. [6b] and a hefty transaction cost c = 0.025 in Fig. [6a] As seen from 
the figures, the TRP algorithm with the ML estimators readily outperforms the other strategies under 
different transaction costs on this historical data set. 

VI. Conclusion 

In this paper, we studied an important financial application, the portfolio selection problem, from 
a signal processing perspective. We investigated the portfolio selection problem in i.i.d. discrete time 
markets having a finite number of assets, when the market levies proportional transaction fees for both 
buying and selling stocks. We introduced algorithms based on threshold rebalanced portfolios that achieve 
the maximal growth rate when the sequence of price relatives have the log-normal distribution from the 
well-known Black-Scholes model [8]. Under this setup, we provide an iterative relation that efficiently 
and recursively calculates the expected wealth in any i.i.d. market over any investment period. The terms 
in this recursion are evaluated by a certain multivariate Gaussian integral. We then use a randomized 
algorithm to calculate the given integral and obtain the expected growth. This expected growth is then 
optimized by a brute force method to yield the optimal target portfolio and the threshold to maximize the 
expected wealth over any investment period. We also provide a maximum-likelihood estimator to estimate 
the parameters of the log-normal distribution from the sequence of price relative vectors. As predicted 
from our derivations, we significantly improve the achieved wealth over portfolio selection algorithms 
from the literature on the historical data set from [10]. 
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